CONVERGENCE RATES FOR AN OPTIMALLY 
CONTROLLED GINZBURG-LANDAU EQUATION 



MATTIAS SANDBERG 

Abstract. An optimal control problem related to the probability of 
transition between stable states for a thermally driven Ginzburg-Landau 
equation is considered. The value function for the optimal control prob- 
lem with a spatial discretization is shown to converge quadratically to 
the value function for the original problem. This is done by using that 
the value functions solve similar Hamilton-Jacobi equations, the equa- 
tion for the original problem being defined on an infinite dimensional 
Hilbert space. Time discretization is performed using the Symplectic 
Euler method. Imposing a reasonable condition this method is shown 
to be convergent of order one in time, with a constant independent of 
the spatial discretization. 
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1. Introduction 

We shall in this paper study the convergence of the Symplectic Euler 
scheme for approximating optimal control of the real Ginzburg-Landau equa- 
tion. This follows the work developed in [20], where a convergence result 
for the value function to an optimally controlled ODE is obtained using the 
corresponding Hamilton-Jacobi equation. As there exists a rigorous the- 
ory also for infinite-dimensional Hamilton-Jacobi equations, developed by 
M. Crandall and P.-L. Lions [7-13], it is possible to perform a convergence 
analysis for a spatial discretization of an optimally controlled PDE, using 
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Figure 1.1. Upper curve: 99+. Lower curve: 

that the value function is a viscosity solution to an infinite-dimensional 
Hamilton-Jacobi equation. In this paper the analysis is performed for the 
specific problem at hand, but hopefully the analysis is clear enough to make 
adaptations to other circumstances (fairly) easy. 
Consider the stochastic PDE 

^t = 5^xx-5-^V'{^) + ^e7], in [0,r] X [0,1], (1.1) 

where 5 is a positive number and r] is white noise in two dimensions; this 
means that ry is a random Gaussian distribution with zero mean and covari- 
ance 

E{i]{x, t),r]{x', t')) = 5{x - x')5{t - t'), 

where E denotes the expectation and 6 the Dirac delta distribution. The 
"state" variable ip satisfies the Dirichlet boundary conditions 

ip{t,0) = ip{t,l) = 0, 

and V is the "double-well" potential 

see Figure [TT2I In one space dimension the stochastic PDE (jl.ip makes sense, 
as existence and uniqueness of solutions can be proved. Taking e = 0, the 
solutions to (jl.ip generically approach one of the two stable critical points, 
or (/?_ , (see Figure II. ip , which constitute minima to the energy 

l\^-^l + 5-'V{^))dx. 

With a small e, the solutions to (jl.ip spend most of the time in the vicinity 
of either ip^ or (/?_, but as rare events make the transition from one to the 
other. The equation (jl.ip may therefore be taken as a model for thermally 
driven phase transitions, nucleations, etc. 
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The probability of jumping from to ip- in the finite time T is related 
to the action functional 

I{^) = \ r l\ipt-5^xx + 5-^V'{ip)fdxdt. (1.2) 
^ Jo Jo 

Introduce the probability Pt that a solution ip to (jl.ip . with ip{0) = ip^, 
satisfies ip{T) S S, where S is an open subset of the set of continuous 
functions on the spatial interval [0,1]. Theory of large deviations in [17] 
gives that 

-I{S) < liminfelogPT 

and 

lim sup e log Pt < —I{S) 

where 

/(S)=inf/(v9), 

with the infimum in the last equality taken over all continuous functions (/? 
in [0, r] X [0, 1] starting at 99+ and ending in S, and where S is the closure of 
S. By taking S a small neighborhood of ip- we can for small e approximate 
the probability of transition from 99+ to ip- with 

In [15] the minimization of ()1.2p is performed for p{0) = and p{T) = 
ip- using optimization of a finite difference approximation. In this paper 
(p{T) will not be held fixed, but instead a penalty cost at the final time 
is added to the functional ()1.2p in order to force the solution to end up 
close to (p-. The optimal control problem which will be considered here is 
the following. Minimize, over all a G L^(0, T; L^(0, 1)) , the value v^^fi{a), 
where the functional v is defined by 

VvoM= [ h[a{t))dt + g[^{T)), (1.3) 

J to 

and where is a mild solution to 

= 5P>XX - 5"V'((/?) + Q, ip(tQ) = ipQ. (1.4) 

In order to define mild solutions we denote by S{t) the contraction semigroup 
of bounded linear operators on L^(0, 1) generated by 5d'^/dx^ defined on 
F(}(0, 1) n ^2(0, 1). A mild solution to ([131) is a function <p G C(to, T; L^) 
such that, for all to < t < T, 

^it) = Sit-to)ipo+ [ Sit-s){-5-Wi^is)) + a{s))ds. (1.5) 
Jto 

In the appendix existence and uniqueness of weak solutions in C{to, T; Hq) of 
(II. 4p is proved when the starting position po G Hq{0, 1). Such weak solutions 
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Figure 1.2. The original potential V is drawn with the solid 
line. The modified potential V coincides with V in the inter- 
val [— s, s] and is drawn with dashed lines outside that inter- 
val. 

are also mild solutions (this can be seen by using e.g. the calculation on 
page 105 in [19]). Furthermore, with a bounded in L'^{to,T; L'^), the weak 
solution is bounded in C{to, T; Hq). Hence, the potential V may be changed 
outside an interval [— s, s] without changing the result of the optimal control 
problem. For simplicity, we shall henceforth use the potential V in Figure [L2] 
and quickly change notation, so that we let V = V, i.e. V is given by the 
dashed line. Letting the transition from the interval [—s, s] to the outside 
be a smooth one we can assume that arbitrarily many derivatives of V are 
bounded. When the function V' is bounded in supremum-norm and the 
control, a, is bounded in L'^{tQ,T] L^), uniqueness of mild solutions to (|1.4p 
holds; see [4]. For starting positions ipo G Hq{0,1) it therefore holds that 
mild solutions and weak solutions are the same, and for the analysis either 
concept of solution may be used. 

The running cost, h, corresponds to the action functional ()1.2p as 

h{a) = ||a||^2(o,i)/2, (1.6) 

and the final cost is the squared distance from ip^, 

g{ip) = K\\^ - ip_\\l2^Q^^y (1.7) 

where K is a constant large enough to force ip{T) ~ (^_. We denote by u the 
value function, i.e. the best possible value of ()1.3p for each starting position 

u(v?o,io) =mf{^;^o,to(a) : a e {to,T; L\0,1))} (1.8) 

Notation: We henceforth let || • || and (■,•) be the norm and inner 
product on (0, 1), and | • | be the supremum norm on R. The Dirac delta 
distribution will be denoted 5, as 6 is used as the diffusivity constant. 
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Outline: Section [2] contains some facts regarding the value function, which 
are apphed in Section [3] when the error from the spatial discretization is es- 
tablished. In Section [4] convergence of the time discretization using the 
Symplectic Euler method is examined. Under a reasonable condition, this 
method is shown to be convergent of order one, with a constant independent 
of the spatial discretization. Numerical results with examples of the conver- 
gence rate for discretization in both space and time is given in Section O 

2. Preliminaries 

This section contains some results which will be needed when the spa- 
tial discretization error bound is established in Section [3l We start with a 
theorem about the boundedness of optimal controls. 

Theorem 2.1. For all (po £ Hq{0, 1) and < tQ < T the value function u 
satisfies 

^ilv'Ojio) = inf {u^oA)(«) : l|a|lL-'(io,T;L2) < -E^llv^oH + 

where the constants E and F depend on 6, K , T , \V'\ and \ V"\, hut not 
on ipo and to- 

Proof. It is first shown that with a{t) = 0, for all t, the state variable at 
the terminal time, f{T), is bounded in by a constant which depends on 
the starting point (pQ. This can be done by taking the inner product with 
If in (II. 4p . using ^ 0, and noting that the function t i— > ||y'(t)|P is 

absolutely continuous with {ip,ipt) = ^||(/?|p/2 almost everywhere in [to;^]- 
Hence 

^^\My2<-5-\V\^),^)<6-'\V'\-\M, 
almost everywhere in [tQ,T], and thereby 

j^\M < S''\V'\. (2.1) 

By the fact that tpt is bounded in L'^{tQ, T; L^(0, 1)) (see [16]), it follows that 
the function 1 1— > ||(/?(t)|| is absolutely continuous, and therefore (j2.ip implies 
that ||(/9(r)|| is bounded by ||</?oll +5~^|y|T. Hence the final cost, g[(p{T)), 
is bounded in terms of the starting position: 

9{^{T)) = KMT) - < 2K{MT)\\' + 

< AKW^poW^ + AK6-''^\V'\'^T^ + 2K\\ip^\\'^ =: M. 

It therefore holds that 

u{ipQ,to) = inf {wvpciol") : \Mh{to,T;L^) ^ ^M}. 

For all a bounded by \/2M in L'^{tQ,T; L"^) we have that (p{T) is bounded, 
again by taking the inner product with ip in (|1.4p : 

~\M'<s-'\v'\-\M + \\a\\-\M, 
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which imphes ^||<^|| < + ||a|| and so 

mm < ||(^o||+<5-^|V^'|r+ / llalldt 

J to 

< IIV'oll + ^'^\y'\T + VT\\a\\L2(^t^^T.L2'^ 

< \\ipo\\ +6'^\V'\T + V2TVm < E\\ipo\\ +F, 

for some constants E and F which do not depend on ipo and to- 

It shah now be proved that changing the control a a smah amount changes 
the state a small amount. We shall therefore compare two solutions, ip^ 
and (/?^, both starting at ((/joj^o)) such that ip^ solves (jl.4p with control 
and (/?^ with control a^. Subtract the two evolution equations and take the 
inner product with (p^ — ip"^ to obtain 

l II 1 2||2 I r|| 1 2||2 

2^11^ II +^m-^.\\ = 
which, by the boundedness of V" , entails 



' ^^\\<6-'\V"\-\\ip'-^-'\\ + \\a'-a^\ 



dt 

By Gronwall's lemma we therefore have that 

miT)-ip\T)\\<exp{5-'\V"\T) f \\a^ - a^\\dt, 

J to 

so, provided and are both bounded by V2M in L'^{to,T; L"^), the 
difference in terminal cost has the following bound: 

\g{ip\T)) - g{ip\T))\ = K\\\ip\T) - ip.\\' - \\ip\T) - ^.\\'\ 

= K\{ip\T)+^\T) - 2^.,^\T) - ^\T))\ 

< 2K(i?||(^oll + + llv^-ll) • ll^'(r) - vHni 

<R \\a^ - a'^Wdt, 

Jto 



where 



i? = 2Kexp(rV'|r)(^ll<^o|| +i^ + ||¥'-ll) 
=:E'm\\+F\ 

(2.2) 

with the constants E' and F' independent of ipo and to- Let now be a 
control bounded by \/2M in L'^{tQ,T; L"^) and let be a modification of 



a': 



a^(t), for ah t such that ||a^(t)|| < 2R, 
0, otherwise. 



OPTIMAL CONTROL FOR GINZBURG-LANDAU 

The difference in the terminal cost thus has the bound 

\g{^\T))-g{^\T))\<Rf \\a^\\dt, 

J{t:\\a^{t)\\>2R} 

while the difference in running cost is 

dt>R \\a \\dt, 



l{t:\\aHt)\\>2R} ^ J {t:\\a^{t)\\>2R} 

so 

Hence for any control bounded in L?'{tQ,T;L?') there is another control, 
bounded by 2E'\\^pQ\\ + 2F' in L°°{to,T; L^), which gives a smaller or equal 
value functional. □ 

With Theorem 12.11 the theory in [4] may be used, which establishes exis- 
tence of optimal controls to u in (|1.8p . We state this in a corollary. 



Corollary 2.2. For each {ipQ,to) G Hq{0, 1) x [0,T] there exists a minimizer 
a, bounded in L°°{to,T; L'^), in ([TT 



Proof. Use Theorem [2?T] and [4] . □ 

Theorem l2.1l is also used when proving Theorem 1 2 . 3 1 ab out semiconcavity. 
In [3] a theorem on semiconcavity on L^(0, 1) x [0,T) is established. This 
result could have been used in this paper, but as only the weaker result of 
semiconcavity on Hq{0, 1) x [0, T] is needed for our purposes, an easier proof 
is given for this case. 

Theorem 2.3. The restriction of the value function, u, to Hq x [0, T] is 
semiconcave. 

Proof. It will be shown that for every constant B, every closed interval 
/ C [0,T), and all starting positions {ipQ,t^) and ((^q,*^) with | |<^ol l-ff^(o i) + 
II'/^oIIhq(o 1) — ^ G /, there is a constant C such that 

1 1^) - 2u( ^^i±^, t±^) < C{U - ^IWl, + \t' - t^\^). 



u{ipQ,t ) + u{ipf^,t 

In order to keep constants simple we use that u may be defined in Hq{0, 1) x 
(— cxD, T], so that we may set t^ = h and t'^ = —h, and realize that the result 
for other times follows analogously. In this proof we let C be any constant 
which may depend on B. 

Let a : [0,T] be an optimal control for the cost functional v^i^^2 

defined in ()1.3p . and let (/^^ : [0, T] Hq be the corresponding state. Define 
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controls for solutions starting in ((^J, h) and {ip^, —h) by dilations of a as 

t - h 



T-hJ' 



a [t] = a\ 1 — — - , , 



and let the corresponding states be denoted ip^ : [h,T] Hq with (p^{h) = 
ifQ and 99^ : [—h,T] — > Hq with ip^{—h) = (/9q. The evolution equation 
(jl.4p for if^ and (^^ is now transformed to the interval [0, T]. The following 
equations are thereby obtained: 

^\ = ^i5ipl, - S-'V'{^') + a), ^i(O) = ^l, 



= -^{d^L - S~'V'{v') + a), ^^(0) = ift. 



The function 

z{t) = ip\t) + ip'^{t) - 2ip^{t) 

is now introduced. We will obtain a bound for ||z(T)||. The equation solved 
by z is 

+ Y^^' - + Y^v'i^') - V'{^')). (2.3) 

It is therefore necessary to find a bound for 99^ — 99^. The evolution equation 
for (p^ — (p^ is 

((^1 - = 5{p^ - ^^.^ - 5-\V'{p') - V'{^^)) 

- f (^^ + + Y^V'i^') + V'ip')) - fa. (2.4) 

After the inner product is taken with ip'^ — p^ the following inequality is 
obtained: 



-'-'^11 1 2||2^x-llT^"llll 2||2i'^^||l I 2||||1 2| 

-j^Wp -P^W <5 \V \ -\\p -p\\ + yIIv'xx + • Ik 



+ ^-^\\v'{p') + v'{p')\\ • 11^1 - 99^11 + • 11^^ - ^'11, 
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and hence 

+ + f INI- 

Thus, by Gronwall's Lemma, 

\\^\t)-^'{t)\\<e'~'\'"'\Vm-^\0)\\ + 

Since | |<^ol 1^,^(0,1) + I Iv'ol 1/^^(0,1) ^ B it follows that y?^ and cp'^ are bounded 
by a constant C in L^(0,T;//^); see [16]. Together with the fact that V' is 
bounded this implies that 

\\^Ht)-^\t)\\<C{\\^l-^l\\+h), forallO<t<T. (2.5) 

Equation (j2.4p is now used once again together with the fact that — 
^'(v^)! ^ ■ ~ V'^l ^^'^ Theorem 5 on page 360 in [16], to draw the 
conclusion that 

esssup||v9^(i) + W^l^ - (pIJ\l2(o.T;L2) < CiWipl- iflWHi +h). 

0<t<T ° ° 

(2.6) 

There is also the term V'{ip^) + V'{<p'^) - 2V'{ip^) in ([23]). This can be 
handled as 

|FV) + y'(v.2)-2yV)l 

< W'i^') + V'{^') - 2V'i^^^)\ + 2|y'(^^) - V'{^')\ 

<^-^\^'-^r + \V"\-\^\- (2.7) 
We are now ready to take the inner product with z in (|2.3p to obtain 

< f\^' - ^r\z\dx + s-'\v"\ . \\z\ 



Jo 

+ fll^L-9^Lll-INI + in^ll^^-^ 

which implies 

|||.||<r^in-INI + ^^ll^^-v^^lli.(o,i) 



,2 1 
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By Gronwall's Lemma 



+ Y\\^L-^U + \V''\^-^\\^'-^'\\)dt. (2.8) 



Sobolev's inequality gives that — '/?^||l4(q_i) < C'H^J"'^ — (/9^||j|^i(o i), so (|2.6p 
together with (j2.8p imphes that 

\\z{T)\\<C{\\vl-^l\\l,^^^^^ + h'). 

This fact is now used to show that 

+ v^l-hi(^') - («) < C{\\^1 - iplW" + h^), (2.9) 

2 

The terminal cost is treated first. We use the notation ipT = ^(T) and 
perform a simple rearrangement: 

<C{\\^l-^l\\l,+h^), (2.10) 

where (j2.5p . (j2.8p . and the fact that Lp}p^ (p^ and (/Jj,, are bounded are used. 
The running costs must also be treated. A simple calculation shows that 

/ \\a^\\^dt+ [ \\a'^\\'^dt-2 [ \\a\\'^dt = 0. (2.11) 

Jh J-h Jo 

The desired result I^M) follows from ([230l) and (I2TTD . □ 

3. Discretization in space 

We shall compare the value functions associated with our original problem 
and a finite element approximation. The value function we want to approx- 
imate is u defined in (jl.Sp . The approximate value function is, similarly as 
in (HI 



u{(po,to)= inf \g{ip{T))+ h{a)ds : ip{to) = (po\, (3.1) 

where (p G C{tQ,T;V) solves 

{^t,v) = -5i^,,v,) + {-6-^V'{^) + a,v), forallt;Gy, (3.2) 

and V is the space of continuous piecewise linear functions on [0, 1] which are 
zero at and 1 and linear on the intervals (0, Ax), (Ax, 2Ax), and so on. We 
note that the infima in ()1.8p and (13. Ih are attained, using Corollary 12.21 for 
the original problem (jl.8p and the easier theory in [5] for the approximation 
problem (|3.ip . Therefore we can replace the infima with minima. The 



OPTIMAL CONTROL FOR GINZBURG-LANDAU 



11 



same sort of convergence analysis which is presented here is performed for 
problems of optimal design in [6] . 

We now introduce some notation needed in Theorem 13.11 We denote by 
P the projection from L'^{0, 1) to V or from L'^{0, 1) x M to ^ x R. Let 
be an open subset of a Hilbert space X, and z : — > M. For any xq £ ^ 
the superdifferential D^z{xq) is defined as follows: 

n+ ^ ^ f r- Y\r z{x) - z{xo) - {p,x - xo)x , ^\ 
D^z{xo) = <p £ X \ lim sup j j < > . 

x^xo \x-Xo\x > 

The Hamiltonian, i7, for the optimal control problem p.8|) is given by 

H{\ip) = -5(A.,(/,.) -<5-i(A,y'M) - ||A||V2, (3.3) 

for all A, (/3 G -f^o (0, 1). The restrictions of u to the subspaces V x [0, T] and 
Hq X [0, T] are denoted uy and uh- 

Theorem 3.1. Let ipo £ V. Denote an optimal pair (control and state) for 
uitpo^to) by a : [to,T] — > and ip : [to,T] —^ and an optimal pair for 
u{ipo,to) by a : [to,T] — > V and (p : [to,T] — > V. Then 



^ {pl{s)+H{Pp*^{s),^{s))ys 

< u{ipo,to) - u{Lpo,to) 

< g{Pp{T)) - g{^{T)) + {h{p*{s), P^{s)) - H{p*{s), p{s))) 



(3.4) 

where p*{s) = (p* (s),p*(s)) G i^(0, 1) x M is any measurable function with 
values in D~^uh{'p{s), s) , andp'^{s) = {p1^{s)^pf {s)) G y x M zs any mea- 
surable function with values in D~^u(Pip{s),s). 

Proof. We divide the proof into two steps: In Step 1 we obtain a lower bound 
for n((/7o, to) —uiipojto), and in Step 2 we do likewise for u{ifo,to) —u{ifo,to). 

Step 1. Using the definitions (jl.Sp and (|3.ip for u and u, the fact that 
u(^(p{T),T^ = g(^(p(T)^, and that uh is the restriction of n to Hq x [0,T], we 
can write 

u{y^o,to) - u{ipo,to) = UH{y:'{T),T) - UH{y:'{to),to) + / h{a)ds 



to 

T , d _ _ . 

—UH{y:>{s),s) + h{a{s))jds, 



(3.5) 



since uh{^{s), s) is absolutely continuous as u is locally Lipschitz continuous 
(see [3]) and (p is absolutely continuous as a function of s. 

We now use that uh is a semiconcave function (with linear modulus), so 
that for every p G D^U}{{zq) there exists a constant K such that 

uh{z) - uh{zq) - {p, z - zq) < K\z - zq\^ (3.6) 
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for all z in a neie; hborhood of zq £ H^{0,1) x (0,r); see [21. Let now 
p*{s) = (p^(s),Pi(s)) be any element in D+UH{(pis),s) D (^^(0,1) x M). 
Consider a point s where the derivative (^i(s) exists. A lower bound for 
the backward derivative of Miy((^(s),s) will now be obtained. We split the 
difference quotient approximating the backward derivative at s: 

uh{'p{s), s) — uh{(p{s — As), s — As) 
A^ 

= - [uh{^{s - As),s - As) - uh{(p{s),s) 

-p;{s)i-As) - (p;(s),(^(s - As) - ^is))]/As 

If equation (|3.6p is used together with the fact that is differentiable at s it 
can be deduced that the quotient involving the square bracket in the above 
equation is greater than or equal to —K'As, for some constant K' . Letting 
As ^ we see that 

-^uh{Hs),s) >p*tis) + (p*(s),(^t(s)), 

where (temporarily) d/ds denotes the backward derivative. In order to be 
able to apply ()3.2p we note that ipt & V implies 

{Ply,'ft) = {Pp%,(pt)- 

Thus the integrand in (13. 5p . using the backward derivative, can be bounded 
from below as follows: 

^UH[^{s),s)+h{a{s)) >pl{s) + {^t{s),Pp*^{s))+h{a{s)) 

= p:{s)-6{^,{s),{Pp;{s)),)-6-'{v'{^{s)),Pp*{s)) 

+ (a(s),Pp;(s)) + i||a(s)||2 

>p:is)+H{Pp;is),^is)), 

since 

H{X,ip)= min (-6{ip,,\x)-6-\V'{ip),X)+{a,X) + l-\\a\\^). 

The double sided and the backward time derivatives of uh{(p{s), s) differ on 
a set of measure zero, so there is no problem in using the backward derivative 
in (I33D . 

Step 2. Lower bound for u((/9o,to) — ^(<^0)*o)- It is now assumed that 
ifo G V. Similarly as in Step 1 we write, noting that u is only defined on 
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Vx[0,T], 
uivo,to) - u{ipo,to) 

= g{^{T)) - g{P^{T)) +u{P^{T),T) - u{P^{to),to) + C h[a{s))ds 

Jto 

= g{ip{T)) - g{Pip{T)) + (^-u{Pip{s), s) + h{a{s)))ds =: I + 11. 



(3.7) 



A lower bound for part // is obtained by splitting the difference quotient 
approximating the backward derivative at s: 



u[Pip{s), s) — u[Pip{s — As), s — As) 

.s) 

(3.8) 



As 

= - [u{P(p{s - As), s - As) - u{P(p{s), s) 

-pf{s){-As) - {p*{s),P^{s - As) - Pv{s))]/As 

+ Pns) + {ptis),^^^^^^^^ 

The derivative ipt{s) exists for Iq < s < T by the theory in e.g. Chapter 3 
in [18], where we have used also that the control, a = —A, solves an adjoint 
backward parabolic PDE, and therefore is Holder continuous. It is now used 
that ||-Px|| < ||x||, u is semiconcave (see e.g. [5]), and that 



{p*{s) 



Pip{s) - Pip{s -As)\_f ^ (fjs) - f{s - As) 



As ) V^'^^^' As 

in equation (13. 8[) . so that we have, similarly as in Step 1, that 

-^u{Pip{s),s) > pf{s) + (p#(s), (^t(s)). 

By further using Chapter 3 in [18] it is known that equation (jl.4p is satisfied 
in the L sense, with Lp{s) £ H^{0, 1) n H^{0, 1), for to < s <T. Similarly 
as in Step 1, the integrand in (j3.7p . using the backward derivative can be 
bounded from below: 

^n(P(^(s), s) + h{a{s)) > pf{s) + H{p*{s), ^(s)). 

As u is a viscosity solution to the Hamilton-Jacobi equation for the discrete 
value function it holds that 



pf{s)+H{p*{s),P^{s))>0, 



which proves the second inequality in (j3.4p . □ 

Theorem 13.11 will be used when the error between the original and the 
approximate value functions is computed. For this to work some knowledge 
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about the superdifferential D^uh is needed. The dual equation 

-\t = 5Kx - 5-^\V"{^), (3.9a) 
X{T) = 2K{^{T)-ip-), (3.9b) 

is introduced. Let a and (p be optimal pairs as in Theorem 13.11 According 
to Theorem 12.11 it is possible to choose a bounded control. For the mild 
solution A to ()3.9p there exists, according to Theorem 3.1 in [4], a subset 
C C [to)^]i of full measure, such that, for all t £ C, 

ip{t)eH^io,i)nH\o,i) =^ [^\it),-H{X{t),ip{t))) e D+u{<fit),t). 

(3.10) 

By the same theorem, it holds that for almost every t € [ioi^"], 

(A(t),a(t))+M^= min ((A(t),a) + M£), (3.II) 

\\a\\<L 

where L is the bound on the control from Theorem 12.11 This bound is 
included in order to be able to use the aforementioned Theorem 3.1 in [4], 
but since a and A correspond to the original problem (jl.Sp with no bound 
we could also have used any constant greater than L in (j3.1ip . Hence we 
see that (j3.1ip holds also without the requirement that a is bounded. From 
this we draw the conclusion that A(t) = —a{t) a.e. The mild solutions (f 
and A therefore satisfy the system 

ifit) = S{t - to)m + I S{t-s){- rV(v9(s)) - \{s))ds, (3.12a) 

Hto) = ifo, (3.12b) 

X{t) = S{T -t)X{T) - j S{s-t){X{s)V"{ip{s)))ds, (3.12c) 

A(r) = 2K{ip{T) - if-), (3.12d) 

where S{t) is the contraction semigroup of linear operators generated by 
(5^, (13.12ap is equation (jl.Sp with A = —a, and ()3.12cp is the equation for 
mild solutions to (j3.9ap : see e.g. Theorem 3.1 in [4]. Following the notation 
in [18] we introduce 

A ^-6— 

The operator A has eigenvalues = dvr^n^, (n = 1,2,3,...) with corre- 
sponding eigenfunctions ipnix) = \/2 sin(n7rx). Fractional powers of A may 
be defined using this: 

00 

A'y^ = Y,kZ{i^n,v)i^n, (3.13) 

n=l 
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for 7 > 0. The domain for is given by 

oo 

D{A^) = [^eL\0,l): ^kl^i^n,^? <oc]. (3.14) 

n=l 

For 7 = 1 and 7 = 1/2 we have that = and = 

V^ll'/'xll- We state a few useful properties of the fractional powers of A, 
which may be found in e.g. [18]. For any K > and all < 7 < X there 
exists a constant C such that 

\\A^S{t)\\ <Ct-^, fort>0, (3.15a) 
and if < 7 < 1,V9 G I?(^^), 

\\{S{t)-lM<-Ct^\\A'r^\\. (3.15b) 
7 

It also holds that 

A^'A^' = A^'a^' = A^'+^' on D{A^'+^') when j\-f^ > 0, 

(3.15c) 

A^S{t) = S{t)A'^ on D{A^), t > 0. (3.15d) 

In the following Theorem it is shown how an element in D^Uf{{(p{s), s) 
can be obtained, which is needed according to Theorem 13. 1[ 

Theorem 3.2. When ipo £ V and < s <T one element in D^uh{^o, s) 
is given by 

(a(s),-F(A(s),^o)), 

where X is a mild solution to (|3.9p and (p is an optimal solution to (jl.Sp with 
to = s. 

Proof. Using Chapter 3 in [18] we have that (p{t) G H^{0, 1) n ^^(o, i) for 
to <t<T, so by (I3T0]) 

(A(t), -H{X{t),ipit))) G D+u{ip{t),t), a.e. 

It follows from the definition that then also 

(A(t), -H{\{t), ip{t))) G D+UHiipit),t), a.e. 

We shall verify that the semiconcavity of uh implies that 

Zn Zq, D^UuiZn) 3 Pn ^ P =^ P G D"^Uh{zq). (3.16) 

In order to prove this we use (j3.6p at the points Zn- We thereby have that 

uh{z) - uh{zq) - {p,z- Zo) 

=Uh{z) - UH{Zn) - (Pn, Z - Zn) 

+ UniZn) - Uh{zq) + {Pn-P,Z- Zq) + (p„, Z - Zn) 

<K\z- zq\'^ + e, 
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where e can be made arbitrarily small by using the convergence Zn zo, 
Pn — > P, and that uh is continuous (it is even locally Lipschitz continuous, 
see [3]). Hence 

uh{z) - uh{zq) - {p,z- zq) < K\z - zol^, 

which implies that p G D^unizo), and (13.160 holds. (As can be seen in the 
above argument it suffices that Pn ^ P weakly, but we will not need this 
here.) 

Since the Hamiltonian H : Hq{^,\) x Hq{{),1) ^ M is locally Lipschitz 
continuous and p.lOp holds, what remains is to prove that ip and A are 
continuous as functions of time with values in -^9(0, 1) at t^. By equation 
(j3.12ap we have that 

A^l\^{t) - vpo) = {S{t - to) - I)A^'\, 

+ / A^''^S{t-s){-5-^V'{ip{s))-X{s))ds, (3.17) 

where passing A'^/'^ under the integral sign is justified by the fact that A'^/'^ 
is a closed operator. By ()3.14p it is a straightforward calculation to confirm 
that V C D{A'^) for 7 < 3/4. Since lpq G V, (|3.15bp may be used to get a 
bound for the first term in the right hand side of ()3.17p : 

\\{S{t - to) - /)^^/Vo|| < 10Cti/^0||A3/Vo||. 

The norm of the integral in (|3.17p converges to zero as t — > to by (|3.15ap 
and the fact that V' and A (since it equals —a) are bounded. Hence 

||Ai/2((^(t) -(^o)ll ^0, ast\to. 

The function A is also continuous as a function with values in -ffo(0, 1) when 
t \ to, as, by Theorem 3.5.2 in [18], ||^'*'At|| exists when t < T and 7 < 1, 
e.g. 7 = 1/2. □ 

In order to be able to use Theorem 13.11 and 13.21 a few results about the 
regularity for the state and the dual is established. The original setting, 
without discretization in space, is considered first. 

Theorem 3.3. For every C > and all starting positions (930, to) satisfying 
I Iv'ol |_f/i(o 1) Q <tQ<T, there exists a D > and an optimal state (p 

to problem (II. Sp . with corresponding dual X, solving (13.91) . such that for all 
to<t<T, 

II<^WIIh1(o,i) < A (3.18a) 
Mt)\\HHo,i)<D{t-to)-^/\ (3.18b) 
l|A(t)||H,5(o,i) (3.18c) 
m)\\HHo,i)<D{T-to)-'/^ (3.18d) 
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//(/9o satisfies the higher regularity \\A^/'^ipQ\\ < C it further holds that 

Mt)\\H^i^^,i)<D{t-tQ)-^/\ (3.18e) 

\\m\\m{o,i)<D[T-t^r^'\ (3.18f) 

Proof. In the proof, we will write D for any constant which may depend on 
C, but not on t^. 

Step 1. By theorem 12. II it is possible to choose an optimal control a such 
that ||a(t)|| < L, for some constant L which only depends on C. Since 
A = —a the same holds for A. 

Step 2. Since and S{t) commute (see ()3.15dp ) we can operate with 
A^f"^ on equation ()3.12ap to obtain 

A^/VW = 5(t - to)^^/Vo + f A^'"^ S{t - s){- {<^{s)) - \{s))ds. 

As S{t) is a contraction semigroup and by the boundedness of V and | |A(s)| | 
together with (|3.15ap it therefore holds that 

<||Ai/Vo||+^ [\t-s)-^/^ds, 
Jto 

and hence Hv^xC^)!! ^ D. By a Poincare inequality (e.g. Proposition 5.3.5 
in [1]) ([3lHal) holds. 

Step 3. Since A(r) = 2K{(p{T) -(/?"), boundedness of A(r) in H^{0, 1) 
follows. The same analysis for (j3.12cp as was performed in Step 2 may 
therefore be used. Using that ||A(s)|| is bounded for all s gives p.lScp . 

Step 4- Operate with A on (I3.12a|) to obtain 

Aifit) = A^/^S{t-to)A^/^ipo+ f A^/^S{t-s)A^/'^{-5-^V'{ip{s))-\{s))ds. 

(3.19) 

Since (p{s) and A(s) are bounded in Hq{0, 1) for all s it holds that 
\\A^/^-6-W{ip{s))-X{s))\\ < D. 

Therefore 

\\A^{t)\\ < D{t - to)-'/^ + D [\t - s)-^'^ds < D{t - to)-'/', 

Jto 

as T is finite. So (I3.18bp holds. 

Step 5. In this last step we use the operator A in equation (j3.12cp in the 
following way: 

AX{t) = S{T -t)AX{T)-5-^ A^/'^S{s-t)A^/'^[x{s)V"[^{s))^ds. 

(3.20 ) 

The bound for '^{T) in H^{^,\) may be transferred to A(T) by (|3.12dp . 
which yields ||A(r)|| < D{T - to)"'/'- As both ^{t) and \{t) are bounded 
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0.5 0.6 0.7 O.B 0.9 1 



Figure 3.1. Two of the nine basis functions when Ax = 1/10. 



in H^{0, 1) for ah to < * < ^ the integral in ([3:20|l is bounded by 



D 



Since T is finite ()3.18dp holds. 

Step 6. Equations (|3.18ep and ()3.18fp can be proved similarly as in Step 
4 and Step 5 by changing the first term in the right hand side of (|3.19p to 
A'^/'^Sjt - to)A ^/'^cpo. By this we see that < D{t - to)"^/^, which 

implies (|3.18f|) . just as in Step 5. □ 

A regularity result for the spatially discretized case is now to be estab- 
lished. According to theory in e.g. [5] the optimal control problem ()3.ip . 
(|3.2p has a minimizing control a. The corresponding state is denoted 
The value function is differentiable along optimal paths, i.e. the derivative 
exists at u{(p{s),s) for all s G {to,T]. The spatial Gateaux derivative of u 
at ((^(s),s) will be denoted A(s). The optimal state, if, and the Gateaux 
derivative A satisfy the following system: 

{^t,v) = -6i^^,v^)-{6-Wi^) + ~X,v), foiaWveV, (3.21a) 

^{to) = (^0, (3.21b) 

-iXuv) = -6iXx,v,)-6-\~XV"{^),v), forall^;Gl^, (3.21c) 

A(r) = 2K{^{T) - Fiojyip-). (3.21d) 

Furthermore, the theory in [5] reveals that the optimal control, a, satisfies 
a = —A. Some new notation is now introduced. We let the interval [0, 1] be 
divided into M subintervals with Ax = 1/M, and let {v^}ff^i^ be the stan- 
dard nodal basis in V; see Figure 13.11 The interpolant, I, takes a function 
in C([0, 1]) to the element in V which coincides with the original function at 
iAx, i = 1, . . . , M — 1, so that, for instance, /A(iAx) = X{iAx). The second 
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difference quotient matrix and the mass matrix B are introduced as 

f-2 1 ••• \ 

1 



Ax2 



1 -2 1 

■•• ■•• 







: 1-2 1 

Vo ••• 1 -2 y 



(3.22) 



B 



f2/3 1/6 
1/6 2/3 1/6 

••. ••. 



\ 



: 1/6 2/3 1/6 

V ••• 1/6 2/3/ 



If (p{t) and X{t) are written in the basis {v'^}fL^ ^ as 



M-l 



M-1 



i=l 



i=l 



then equations (|3.21ap and (|3.21cp may be rewritten as 



6 



-1 



BC' = 6DX - -^P - B9 



-Be' = 6D'^e 



Ax 
5^ 
Ax ' 



where 



c 



P 



and r 



(3.23) 



(3.24) 

(3.25a) 
(3.25b) 



(3.26) 



We now state a Lemma which will be used in the proofs of Theorem [33] and 



Lemma 3.4. For any element tp G H^{0, 1) H ^^o(0, 1) the projection Pip, 
written in the nodal basis {v^} as 



M-l 



i=l 
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0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 



Figure 3.2. The function v admits values in the interval 
[-2/Ax^,l/Ax^] (here Ax = 1/10). 



satisfies 



M-l 



{AxY,{D^)^f'<CU 



(3.27) 



i=l 



with a constant C independent of Ax. 
Proof. The vector ^ is defined as 



and is given ^ = ^q, where 




which follows from the fact that {ip,v^) = {P^p,v^) for all v^. The matrices 
and commute, since the eigenvectors of and B (and B~^) are 
equal, so D^^ = -^B~^D^q. Every element of the vector D^q, except the 
first and the last, is a L^(0, 1) scalar product between ip and a translate 
of the function v in Figure 13. 2[ It is easy to check that there exists a 
primitive function w to f such that v{0) = v{4Ax) = 0, and that there exists 
a primitive function v to v such that v{0) = v{4Ax) = 0. Furthermore 
maxv < E, for a constant E which does not depend on Ax. Hence, 

\{i^,v)\ = \{i;xx,v)\ < E / \^l^,,\dx < 2EVA^{ / i^l^dx^^^ 

Jo Jo 

The same sort of bound may be obtained also for the first and the last 
elements of D^q by using a 2-periodic, odd extension of tp outside [0, 1]. We 
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therefore have that 

j-Ax f-3Ax MAx 

\\D'^q\\l < 4:E'^Ax{ tpl^dx+ I ip^^dx + tpl^dx + ... 

Jo Jo Jo 

^ V ' ^ V ' 

Prom (D2g)i From (D2g)2 

+ / 4'lxdx+ / ^Plxdx) 

J{M-3)Ax J{M-l)Ax 

^ V ' 

Prom iD^q)M-i 

where || • II2 denotes the standard Euchdean vector norm. The eigenvalues 
of lie in the interval [1, 3], and hence it holds that 

1 9 F 

V Ax 

which is equivalent to (|3.27p . □ 

Theorem 3.5. There are constants E and F, depending on the parameters 
of the optimal control problem, as in Theorem \2.1{ hut not on ipo, £ [Oj^] 
and the size of the spatial discretization, such that for all to < t < T , 

||(^,.(t)|| + ||A,.(t)|| < E\\{ipoU\ + F, 

M-l 

\\Mt)\\<EiAxY,iD\{to)f,f' + F, 



and 



M-l 



\Mt)\\ < {E(Axy(D'Cito))lf' + F 



=1 



where 



M-l 



1=1 

Proof. The proof uses the same kind of techniques as in the proof of Theorem 
5 on page 360 in [16]. Here, however, the regularity of (p must also be 
conveyed to A. In the proof, we will write E and F for any constants that 
may depend on the parameters of the problem, but not on Ax, to, and (pQ. 

Step 1. As in the infinite dimensional case, treated in Theorem 12.11 it 
holds that the infimum in (13. ip can be changed to inf||Q||^^^^ j..v^<e\\^o\\+f- 
The proof goes just as the proof for Theorem 12.11 By a Poincare inequality 
(see e.g. Theorem 5.3.5 in [1]) it holds that W^poW < C'll(vo)x|| and there- 
fore the infimum can be written i'^i\\a\\^^f^^^^.y^<E\\{Lpo)x\\+F- It is therefore 



possible to let f = 93 in (|3.2p and use this boundedness of a to see that 
\\^{t)\ \ < E\\i^o)^\\+F, for all to<t<T. 
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Step 2. As already noted A = — a, and so by Step 1 the same bound on 
||A(t)|| also holds. 

Step 3. With v = (pt iti (13.21ap we have 

(5 d 

^ X-1|T/'I II 7^ II _L II \ll II ^ II ^ ;:-2|T^/|2 I ll'^tlP , PIP I \Wt\? 

<o \V \ ■ \\ipt\\ + \\^\ ■ \m\\ < \V\ H — \ — —. 

The boundedness of {(po)x and ||A(t)|| thus implies that 

<^||((^o)x|| 
\Wt\\L^{tQ,T;V) < E\\{'Po)x\\ +F. 

By this the same sort of bound holds for Xx{T). Letting u = At in (|3.21cP 
gives that 

||A,(t)|| <E\\{^q),\\ + F, 

Pt||L2(io,T;y) < E\\{ipQ)x\\+F, 

similarly as for (p. 

Step 4- All eigenvalues of B lie in the interval [1/3, 1], and so ||i?~^||2 < 3 
(independently of Ax), where || • II2 denotes the operator 2-norm. Prom this 
and (j3.25p it follows that 



M-l 



{to)\\<E{l^xY,{D\{to)1fY" + F. 



1=1 



We now introduce the notation ip = ipt and X = Xt and differentiate equation 
()3.21ap with respect to time: 

{ipt,v) = -5{ipx,Vx) - 5'^{V"{(p)(p,v) - {X,v), for ah v £ V. 

Let V = (f to obtain 

<r'in-IMI= + ^ + ^. 

I I iiY-ii 2 2 
The fact that ||'^t(to)|| ^ D together with the result on boundedness of 
\\M\L^to,T;V) and ||<^||L2{to,T;V) ^ Step 3 implies that 

mt)\\ < E{Ax {D'C{to)f,f' + F, for to<t<T. 
1=1 

Step 5. In this step it will be shown that \ [{P^-)xt'Wx) \ ^ for all 

w € V. Some new notation is introduced: 

A/-1 M-l 
1=1 i=l 
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with corresponding vectors and r]. By means of a partial integration 

\{{Pip.Uw^)\ < Ax\r]^D^^\ < AxM\2 ■ \\DW\2, (3.28) 

with II • II2 denoting the Euchdean vector norm. Since the eigenvalues of B 
lie in [1/3, 1] it holds that 



A 

T — 

and so by Lemma 13.41 (j3.28p gives: 

|((P^_).,u;.)| < AxWD'CW, . Ml, < Ax:^%M • ^IHI = D\\w\\. 

y Ax v Ax 

Step 6. It holds by equation (I3.21dl) that X^{T) = 2K{<^^(T) - {Pip-)x)- 
Using this in (j3.21cp as well as (I3.21ap gives 

-(A(r), = -2K5{^,{T),v^) + 2K6{{P^.\,v^) - 6-\\{T)V" {^{T)),v) 
= 2K{^{T),v) + 2K{6~^V\^{T)) + \{T),v) 
+ 2K5{{P^.Uv^) - 6-\MT)V"{^{T)),v). 
With v = X(T) it follows that 

M-l 

\\X{T)\\<E{Ax^{D'C{to))lf' + F, 
1=1 

by the results in Step 4 and Step 5. In order to bound A at all times, 
equation (I3.21cp is differentiated with respect to time: 

-{Xt,v) = -6{X.,v.) - 6-\XV"{ip),v) - 6'\XV"'{^)^,v). 

With i; = A in the previous equation the following bound is obtained: 

-^|||A|P < 6-'\V"\ • IIAIP + 6-'\V"'\ f \XfX\dx. 

J 

Since A is bounded by -E(Ax ^f£~^(L>2((to))2)i/2 ^ ^ ^1 ^^^^^ 
it is similarly bounded in The last integral in the previous inequality 
may therefore be estimated as follows: 

1 M-\ 

\X(pX\dx < \X\ ■ M\ ■ \\X\\ < [e{Ax {D'C{to))lf' + F 

i=l 

Using J^||A|p = 2||A||J^||A||, Gronwall's Lemma, and the boundedness of 

||A(r)|| we see that ||A(t)|| is bounded by (^(Ax ^^i^^(Z)2((to))f)^^VF)^ 
for all times. □ 

With the error representation in Theorem 13. 11 Theorem 13 . 2 1 ab out D~^uh, 
and the regularity results of Theorem 13.31 and 13.51 it is possible to prove 
Theorem 13.61 about spatial convergence. We first give the idea of the proof. 
When the first integral in (13. 4p is estimated the optimal path ip{s) is used. In 
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time 




state 



Figure 3.3. Sketch showing the idea in Theorem 13. 6[ 

order to obtain an element in D~^uh{'p{s), s) the system of equations ()3.12p 
is considered with ((^(s),s) playing the role of {(po,to). By Theorem 13.21 
the computed A(s) is the spatial part of an element in D^uh[(p{s), s) . 
Therefore, equation (j3.12p must be used for every starting position (<^(s), s) , 
< s < T, as depicted by Figure 13. 3i Similarly, when the second integral 
in (j3.4p is estimated, the optimal path Lp{s) is used, and for each (P(/3(s), s) 
as starting positions the solution A(s) to (|3.2ip is computed. 

Theorem 3.6. For every constant C > and all starting points with 

M-l 

1=1 

such that 

M-l 

{AxY,iD'Olf^'<C, (3.30) 
1=1 

there is a constant D > such that 

\u{ipo, 0) - u{ipo, 0)1 < DAx^. (3.31) 

Remark 3.7. Every reasonable approximation in V of for any Ax, 

satisfies (j3.30p . The interpolant and the projection are possible choices. 

Proof. As in the proof of Theorem 13.51 whenever D is written in this proof 
it means a constant independent of Ax, but (possibly) dependent on C. 
Step 1. It will be shown that condition (j3.30p implies 

||^^/Voll<^- (3.32) 

Using (j3.13p it follows that 

oo 

\\A^^\\ = {Y,kl^{i^n,vff'^ (3.33) 

n=l 
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where kn and ipn are the eigenvalues and eigenfunctions of A. Two partial 
integrations imply that 

where {^o)xx is the distributional second derivative of ipo, i.e. 

A/-1 



i=l 

with 6iA.x the Dirac delta distribution in x = iAx. Since jV'nl^;)! ^ \/2 for 
all n G N and ah x G [0, 1] it holds that 

/rj ^ M-l 

i=l i=l 



i=l 



It thereby follows that the sum in ()3.33p is finite (and proportional to C) 
when 7 < 3/4, e.g. 7 = 5/7. 

Step 2. In this step it is shown that there exists a solution A to ()3.12p 
with (p{s) playing the role of (fQ, such that 

\H{PX{s),^{s)) - H{X{s), <f{s)) I < DAx^{T - s)'^/^. (3.34) 

We start by showing that the starting position (pQ in ()3.29p is bounded in 
i/i(0,l): 

||('/'o)x|P = -{{^o)xx,^o) 

M-l 

< D{Ax {D^Olf^\\Vo\\Lo^io,i) < DWiM, 

i=l 

where the last inequality follows by a Sobolev inequality. Hence 
||((^o)a;|| ^ D. By Theorem 13.51 it follows that ||(^i,.(s)|| < D, for all to ^ 
s<T. By Theorem ES] it then follows that ||A(s)||h2(o,i) < D{T - s)'^/"^. 
The Hamiltonian, H, consists of three parts; see (13. 3p . The first of these is 
the most difficult when ()3.34p is to be proved, so we will focus on this one 
and let the other two parts be treated by the reader. The difference between 
the first parts of the Hamiltonians in ()3.34p is given by 

^,{s),{\{s)-P\{s))^ = {^Us),Ks)-P\{s)), 

where the factor 5 is left out for convenience. We reuse notation and let 

A/-1 
i=l 



{Ax^iD^Olf'<D. 
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SO that 

i=l 

As ||(^t(s)|| < D by Theorem 13.51 it. again using (j3.25p . holds that 

M-l 
i=l 

Hence it holds that 

M-l 

\{^xxis),Xis)-PX{s))\<Ax \{D^Oi{KiAx) - PX{iAx))\ 

i=l 

M-l M-l 

< (Ax j;p2^)2)i/2(Ax ^ (A(iAx)-PA(iAx))') 

i=l i=l 
M-l 

< d(^Ax {X{iAx) - PX{iAx)f^ . (3.35) 

i=l 

By the use of the interpolant the last parenthesis in (j3.35p may be written 

M-l 

(^Ax Y {iXiiAx) - PX{iAx)f^ . 
1=1 

This minor difference simplifies the situation as IX{s) — PX{s) £ V, which 
makes comparison with the norm possible. For an element 

M-l 

!<■= Y ^ ^ 

i=l 

it holds that 

= Ax{ri,Br])2, 

where (•, •)2 is the Euclidean scalar product on M*^. All eigenvalues of B lie 



in the interval [1/3, 1], and hence 



M-l 



IX - PA|p > -Ax Y (A(iAx) - PX{iAx)f. 



i=l 

It also holds that 

||/A - PA|| = ||P(/A - A)|| < ||/A - All < Z)Ax2||A||^2(o,i), 

where the last inequality may be found in e.g. [1]. By equation ()3.18dp we 
have that 

\\X{s)\\h2^o,i) < D{T - sr'/\ 
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and so (I3.34p holds. By Theorem 13.21 an element in D~^UH{y^{s), s) is given 
by ^A(s), — (A(s), (^(s))^ , and thereby it is clear that the first integral in 



()3.4p may be bounded by 

r-T 



DAx'^ [ (T - sy^/^ds < DAx"^. 

Jto 



Step 3. In this step a bound for the second integral in ()3.4p is derived. In 
Step 1 it was proved that ||A^/^(/7o|| < D. Theorem 13.31 then implies that 
||(^(s)||j|/2(o,i) < D{s — to)~'^/'^ . Therefore, by Lemma [3^ and Theorem 13.51 
there exists a solution A to ()3.2ip . with in the role of {ipo,to), 

such that 

\\Ms)\\ < {D{s - to)-'/' + < D{s - to)-^". 
By (|3.25bp it holds that 

M-l 

[AxY,{DH{s)f^ <D{s-to)-''\ 

^ i=l 

where 9 is given by (|3.24p and (j3.26p . In order to be able to use the above 
information to get a bound of the second integral in (j3.4p . we need that 
A(s) is the spatial part of an element in D~^u{Pip{s), s). This follows from 
Lemma 3.3.16 and Theorem 7.4.17 in [5]. As in Step 2 we are satisfied with 
considering only the first parts of the Hamiltonians. The only difference is 
that now the partial integration is performed so that A(s) is distributionally 
differentiated twice: 

A,(s), {^{s) - P^is))^) I = I CX^s), ^{s) - P^{s)) I 
< (Ax {D^e{s))^) (Ax Y s) - Pip{iAx, s)) 



=1 



<D{s-to)-^/''Ax^\\ip^,{s)\\ <DAx\s-tof/\ 
similarly as in Step 2. The second integral in (j3.4p is therefore bounded by 



a term 



DAx^ f {s- toY^'^ds = DAx^. 



'to 

Step 4- It remains to show that the difference g{P(p{T)) — g(ip{T)) is of 
the order Ax^. Since ||(/3(r)|| is uniformly bounded for all starting positions 
in a bounded set in L^(0, 1) we see by (12. 2p that the difference in final costs 
is less than D\\Pip{T) — ip{T)\\. Since PIip = lip, where / is the interpolant, 
introduced in Step2, we have that 

\\Pip{T) - ip{T)\\ < \\P{ip{t) - MT))\\ + IIMT) - ip{T)\\ 

< 2\\Iip{T) - ip{T)\\ < DAx^\\ip^,{T)\\ < DAx\ 
where the last inequality follows by Theorem 13.31 □ 
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The next theorem provides an error estimate which makes comparison 
with the situation where 99+ is used as initial position possible. 

Theorem 3.8. There exists a constant D > such that 

|u((/7+, 0) - n(P(/9+, 0)1 + 0) - u{Iip+,0)\ < DAx^. (3.36) 

Remark 3.9. The theorem shows that both the projection and the inter- 
polant can he chosen when approximating in V . 

Proof. The semiconcavity of uh implies that for every bounded set X C 
-ffo(0, 1) there exists a constant D, such that 

(3.37) 

where p is the spatial part of any element in D^uh{^'^ ,0) (compare (I3.6p ). 

In Theorem [372] it was proved that (^A(O), -H{X{0) , 99"^)^ is one such element, 

where A is a solution to (|3.12p with (/jq = V^- We may therefore take p = A(0) 
in (|3.37p . By Theorem l3.31 \ \p\ \ < D for some constant D (it is even bounded 
in Hq{0,1), but this is not needed here). Plugging this boundedness into 
(j3.37p results in the inequality 

UHiip\0) - UHiip^O) < D{\\^' - ip^W + \\ip^- V^'l 1^1(0,1))- 

We may change places for and 99^ everywhere above, and thereby obtain 

Consider now c/?^ = 99 -|_ and = /(/?_(_ or ip'^ = Pif+- For the interpolant, /, 
the following bounds hold: 

(3.39) 

-If+\\H^{0,1) < D\\{f+)xx\\^X, 

with a constant D independent of Ax and (p^. Since {(p^)xx is bounded in 
L°°(0, 1) this together with ()3.38p directly shows that the interpolant part of 
()3.36p is correct. The projection part is proved by using the result from [14], 
that the projection is stable in Hq{0, 1), i.e. 

1 1 1^1 (0,1) < ^11^11^1(0,1)- 

It therefore holds that 

- -P(/?+||h1(0,1) ^ 11^+ - ^'^+\\h^{0,1) + \\^'^+ - ^'/'+ 1 1^1(0,1) 
= \\V^+-If+\\Hl,{0,l) + \\PiIf+-f+)\\Hl,{0,l) < (l + ^)IIV+--^¥'+ll/fi(0,l)- 

(3.40) 

The same technique as in (I3.40p can also be used for the norm, now using 
the obvious bound ||-P'/'|| < WfW, which implies that 

\\^+-Pip+\\<2\\^+-I^+\\. (3.41) 
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The equations ()3.38p . (13.390 . ()3.40p and (I3.4ip imply that also the projection 
part of (|3.36p is correct. □ 

Theorems 13.61 and 13.81 directly imply the following corollary. 
Corollary 3.10. There exists a constant D, such that 

|m((^+, 0) - n(P99+, 0)1 + |n(v3+, 0) - u{Iip+,Q)\ < DAx^. 

4. Discretization in time 

In [20] the method Symplectic Pontryagin for approximation of optimally 
controlled ODE:s is constructed and analyzed. It is a Symplectic Euler 
discretization for a Hamiltonian system, involving the state and dual vari- 
ables associated with the control problem, with a regularized Hamiltonian. 
In the present situation, when the Hamiltonian is smooth, the Symplectic 
Pontryagin method reduces to ordinary Symplectic Euler, since no need for 
regularization exists. The theory in [20] can be used to show that the differ- 
ence between the value function for a system with only spatial discretization 
and the value function for a system with discretization in space and time, 
is of the order At, where At is the size of the temporal discretization. It is, 
however, desirable to achieve more than this. In order for the estimate on 
the temporal discretization to be useful the constant in front of At in the 
error estimate needs to be really constant, i.e. independent of Ax. 

The theorems in [20] do not directly provide the desired result. This has 
to do with the fact that the second order difference quotient operator D^, 
defined in (j3.22p . has norm proportional to 1/Ax^. Furthermore, the proof 
in [20] requires a bound on the derivatives dX^'^^ / d(p^ , where (p and A are 
obtained with the Symplectic Pontryagin method. The problem of large 
norm of can be handled using that it is a negative operator. But in 
addition to this we also need to bound dX^^^ /dip"^ independently of Ax in 
some proper sense. 

The proof of convergence of the Symplectic Euler method given here is 
based on another technique. It uses that the present problem admits optimal 
controls which are regular by Theorem 13.51 It also involves an assumption 
about the derivative (px, and another similar assumption. Under these as- 
sumptions it is shown in Theorem 14.11 that a minimum of a forward Euler 
approximation of control problem (13. ip . (13. 2p has an error CAt in the ob- 
jective, where C does not depend on Ax. In Theorem 14.51 it is shown that 
the solution to this minimization problem is equivalent to the solution of a 
Symplectic Euler scheme, and hence the desired property for the Symplec- 
tic Euler scheme is achieved. The main difference in the result when the 
present method is used compared to a result using the theory in [20] is that 
the present result needs an assumption on the derivative (fx whereas [20] 
needs control over d\^~^^ / dip^ . The assumptions on (px seem easier to ver- 
ify. The numerical tests performed in Section [5] support that it is true. 
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We now present the setting of the aforementioned discretized optimization 
problem. Consider the time-discrete state which is a forward Euler 

approximation of the state (f in (13. 2[) and is given by 

= ((^",t;) + At(-5((^S,?;^) + (-rV(^")+a",?;)), for all v £ V, 

(4.1) 

where {a'^}^^^ is a time-discrete control. The discrete state therefore 
corresponds to <p{tn), where t„ = n-^ = nAt. By (14. ip it is possible to define 
a discrete value function for all times tm- 

N-l 

u{ipo, tm) = min {g{^N) + Y] ^(a")) , (4-2) 

t" tn = m n=m 

where {(^"} solves (j4.ip and = (^o- For the proof of Theorem l4.1l we also 
introduce the discrete state {ip"'}^^Q. It is also given by a forward Euler 
time stepping scheme, but its evolution is determined by an optimal control 
a to the time-continuous problem (j3.2p : 

((^"+\r;) = {^^'',v)+At{-6{^'^,v^)+{-6~'V'{^^)+a{tn),v)), for all v e V. 

(4.3) 

We will consider starting positions <po in finite element spaces V satisfying 

5{{^o)x,v.)+6~\V'{^o),v) = 0, forallt;Gl^. (4.4) 

We are now ready for the theorem on time discretization convergence. 

Theorem 4.1. Assume there exists a function r : such that for all 

At < r{Ax) there are solutions {(^"j^^g ^^'^ {'P"'}n=o ""^^^^ (fp = ifp = (pQ^ 
where (po satisfies (14.41) . and 

m+'-V>2\\ + M+'-^2\\<CAt (4.5) 

for all < n < N, where C does not depend on Ax. Then 

\u{ipo,0) -u{^o,0)\ < DAt 

for At < r{Ax), where D does not depend on Ax. 

Remark 4.2. By the numerical computations performed in Section it 
seems plausible that (j4.5p holds. 

Remark 4.3. The proof would be valid without inclusion of the function 
r. However, since the forward Euler method is used it seems reasonable to 
believe that (|4.5p would not be valid for all At. 

Proof. As for Theorem 13 . 1 1 the proof is divided into two steps. We obtain in 
the first step a lower bound for u{(fo,0) — u{ipo,0), and in the second step 
a corresponding upper bound. The first step in this proof is similar to the 
first step in the proof of Theorem 13. 11 while the corresponding second steps 
differ. We denote an optimal pair (control and state) for uhy a and ip, and 
an optimal pair for u by {a""} and {(^"}. 
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Step 1. This part of the proof starts by an extension of the initially 
time-discrete state {(^"} to a piecewise linear time-continuous function : 
[0, T] — > y as follows: 

^{t) = + -^^"P , for tn < t < tn+l- 

As in the proof of Theorem 13.11 we have 

rT ^ N-l 
u{<f 0, 0) -u{tf 0,0) = —u{<f{s),s)ds + Aty^ h{a'). (4.6) 
Jo ds ^ 

In order to be able to use that u solves a Hamilton-Jacobi equation we note 
that the right hand side in (14.60 may be written 

^'+1 , d 



1=0 ^ 



and thus we may focus our attention on one time interval We also 

note that equation (j3.2p defines a flow f : V x V ^ V which is defined by 

{f{^,a),v) = -6{ip,,v,) + {d-^V'{ip) + a,v), for all G K (4.7) 

Let now p{s) = (^p^{s),pt{s)) be any element in D^u[(f{s),s). Similarly as 
in the proof of Theorem 13.11 we have for almost every s E [tn,tn+i] 

= Pt{s) + (p^is), ms), d")) + /i(d") +{p^{s),m\ d") - fms),an) 

' V ' 

>H{p^{s),<^is)) 

> (p^(5),/(<^",d")-/(^(s),d")), 

(4.8) 

since ptis) + H{p^{s), (fi{s)) > as u is a Hamilton-Jacobi viscosity solution. 
By assumption (j4.5p it follows that ^pxis) is bounded for < s < T inde- 
pendently of Ax. We are therefore free to use (^(s) as (po in Theorem 13.51 so 
that the X{s) (corresponding to u{ip{s),s)) is bounded in Hq independently 
of Ax. For such a A(s) we have 

|(A(s),/((^",d")-/((^(s),d"))| 

< 5\ {K{s), - ^x{s)) I + r 1 1 (V'ir) - V'ms)), ~X{s)) I < CAt, 

with C independent of Ax by (j4.5p . It is now used that A(s) is the spatial 
part of an element in D~^u{(p(s), s). It thereby holds that the right hand 
side in (j4.8p is less than CAt in magnitude. 
Step 2. We start by noting that 

?2((^o, 0) - n((^o, 0) > ff(<^(r)) + / h[a)dt - {g{^^) + At V h{a(ti))) . 

(4.9) 
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The difference between the running costs in (14. 9|) is 



N-l 

' [h{a{t))-h{a{tn)))dt. 



i=0 
2 



Using that h{a) = \ \a\\ /2 we have that 

\h{a{t)) - h{a{tn))\ = \\{oi{t) + a(t„), a(t) - a(t„))| 



< ]^\\a{t) + a{tn)\\ ■ \\a{t) - a{tn)\\ < CAt, 



where we have used the result in Theorem 13.51 on the boundedness of the 
control and its derivative (remember that a = —X). It remains to show that 
the difference between the terminal costs in (|4.9p behaves similarly. As in 
Step 1 we now extend the discrete state {(^s"} to a continuous function: 

+ fort„<t<Wi. 

For tn < t < tn+i the evolution equations for (f and (p look as follows: 

{'ft,v) = -5{(px,Vx) + (-5" + a, v), 

for all w € y. Subtract these two equations and let v = ip — if to get: 

1 II - .||2 

= -6i^, - (pl, (px - + 5-\V'{p^) - V'{^),p - ^) 
+ (q - a{tn),p- if) 

+ - V'{ip),^ - (^) + (a - a(t„), - ^) 

\ \ "n ° I |2 I I - ■> 1 12 

<-d\\ipx-fx\\ +0 hO 

c- ii-rw/i II or, o o II II oil — a(in)lP llfp — '/^'IP 



+ — ^ — Il^-V'll + — ^ — • 

According to a Poincare inequality (see e.g. Theorem 5.3.5 in [1]), using 
the Dirichlet conditions, we have that \\ip^ — < — If now 

Gronwall's Lemma is used together with the fact that ||q; — q;"|| + — 
'PxW ^ CAi, we have that ||(^(T) — (/5(T)|| < At. Since g{(p{T)) is bounded 
independently of Ax we have, similarly as in the proof of Theorem l2.1l that 
\g{^{T))-g{p{T))\<CM. □ 
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As convergence of the forward Euler method has now been proved, the 
Symplectic Euler method, which can be used to find the forward Euler solu- 
tion, is now presented. It is given by the system 

= {r,v) + At{-6{r.,v,)-6'\V'{r),v) - (A"+\^;)), 





= V>Oi 






= (A'^+i 






= (A"+i 


v)+At{-6{X'i-^\v^)-5 






) = 2K((^^-P(^_), 



(4.10) 

where g' is a Gateaux derivative and H\{-;v), H^{-]v) are Gateaux deriva- 
tives in the direction v. For every minimizer in (|4.2p there exists a 
solution to (j4.10p with A""^^ = —a"' for all n. In order to prove this we first 
state a lemma. 



Lemma 4.4. The value function u{-,tn) is semiconcave for every n. 

Proof. Consider the starting positions (f^, ai^d '^^^'^^ at time 0. The 
time-discrete cost functional v is introduced: 

Af-l 

^^o,t™ ({«"}) = iai^N) + At J] h{a^)), 

n=m 

where {(^"j solves (14. ip and = ^^o- Let {a"} be an optimal control for 
the starting position ( ^„ ^ ,0)- We can thus write 



u(<^;,0)+n((^o,0)-2n(^i±^,0) 



< 



^^o,o({a"}) + ^^o,o({a"}) - ({a"}). 

o 1" 



iV . 



The states starting in ip\, and ^il^^ all using the control {a"}, are 
called 

M-l M-l M-l 

^i^Y. ^2 = E and (p^.^Y. ' 

1=1 i=l 1=1 
Introducing the notation 

! I ) Pm 



\^m,M-l/ 

where m can be 1, 2, or 3 and 





( ^ 






1 , a" = 





M-l 



i=l 
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we can, using the mass matrix B in ()3.23p and the second difference operator 
in (13.22p . write the equation for t^J^, m = 1, 2, 3, as follows: 

BC^' = BC + At{6D'C - ^-^Pl + Ba-). (4.11) 
Introducing the state z'"' = Ci + ^2 ~ ^^3 using ()4.1ip gives 

= Bz" + ^t{5D'^z'' - +P2- 2P3))- (4.12) 

Every element in the vector 

/ {v'{ri) + v'm-'^v'{<^^,)y) 

\{V'{^^) + V'{^^) - 2y'((^^), t;^^-!)^ 
can be bounded in magnitude by 

\\v'{ri) + v'{r2)-'^v'{^-,)\\-m 



'^-^x\\V'{^-,) + V'{r2)-2V'm 



^Ax||n^?) + F'(^^)-2F'(^^^ 



using the triangle inequality as in ()2.7p . We first treat term / above: 
\\V'{^-,) + V'{^-,) - 2y'(^I±^)|p < ^ - ^^2\'dx 

< ^^(max|^? - ^-2\) = 4^(max|er - ^2"!) 

IT//// 1 2 M-1 IT///' 1 2 

1=1 

where || • II2 denotes the Euclidean vector norm. Part II may be bounded 
as follows: 

2||r(^^^) - V'i^m < AM + ^2- 2^^|| < B\\z^\\2. 
These facts in (j4.12p give that 

||^"+^||2<C||z"||2+Z?||er-C2ll2- (4.13) 

By subtracting the equations for m = 1 and m = 2 in ()4.1ip we see that 

Iier'-e2'''ii2<i?iier-e2"ii2<...<i?"+'iie?-e2°ii2, 

so that in KW we could really write - ^"lli instead of D\\(,'^ - 

Thereby, since z° = 0, it holds that Ik^lb < ^ll^i - ^"lli- Note that the 
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constants A ~ F are allowed to depend on Ax. Similarly as in the proof of 
Theorem 12.31 semiconcavity of ti is a consequence of this. □ 

We are now ready for the promised theorem about the Symplectic Euler 
method. 

Theorem 4.5. For every minimizer {«"} in (j4.2p there exists a solution to 
with A"+i = -a" forO<n<N-l. 

Proof. The proof is divided into three steps. In the first step it is shown 
that the value function u is differentiable along the optimal path (^". In 
the second step it is proved that the dual variable A" equals the Gateaux 
derivative of u, and in the last step it is shown that A""^^ = d". 

Step 1. In order to show that the discrete value function u is differentiable 
at ((^""j t„) for < n < the function 

r(d) = u{(p, tn+i) + Ath{a) (4.14) 

is introduced, where (p = (p'^ -\- Atf{(f'^, a) and / is given by (14. 7p . Assume 
that u is not differentiable at {ip'^~^^ , tn+i ) . Because u is semiconcave it then 
follows that the superdifferential u{(p^^^ , tn+i) (which we let designate 
the superdifferentials in the Gateaux sense) contains more than one point. 
For all d in a neighborhood of d" it holds that 

r(d) - r(d") 

='u((^" + At/((^", d)) - n((^" + At/((^", d")) + At{h{a) - /i(d")) 



<At(p, /(<^", d) - f{ip^, d")) + At{h'{d^),a - d") + 



I 7>'||~ ~n||2 



(4.15) 



=At{p, d - d") + At(/i'(d"), d - d") +K\\a- d"||2, 

where p is an element in u{(p^^^ , tn+i) and p is given by a linear bijection 
of s, since / is linear in the a variable. Since there are more than one element 
p G u{(p'^~^^ , tn+i) , there are also more than one possible p in equation 
(j4.15p . It is therefore possible to choose the element p such that the linear 
term in (j4.15p is non- vanishing. It follows that there exists d such that 
r(d) < r(d"), which is the sought contradiction. By this reasoning we see 
that u is differentiable at [(p^,tn) for < n < A^. 

Step 2. It follows directly that A^ = g'{(p^), i.e. the Gateaux derivative 
of u{-,tN). Assume that A""''"'^ = 'S^((^"^"'^, t^+i). It will follow from this 
that A" = u^{(f'',t n). Since it is known that u is differentiable at both 
{(p^^tn) and {(p"^^^ ,tn+i) the Gateaux derivative of u at (c^"'"''^, t„+i) equals 
the Gateaux derivative at (p^ of the function 

s{^) = + At/((^, d"), t„) + Ath{a^), 

where d" is fixed. The Gateaux derivative of s at (p^ is given by 

5'((^") = u^{r^\tn+i) o (/ + At/'((^")), 

where Uip{(p^^^ ,tn+i) = A""*"^ is a function from F to M and /'((^") is a 
function from V to V . This equation coincides with the A equation in 
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(j4.10p . which gives that A" = By induction in n it follows that 

A" = tn) for < n < iV. 

Step 3. Knowing that u is differentiable at for < n < 

the function ()4.14p can be differentiated. Since a" is a minimizer of r the 
derivative at this argument must be zero: 

r'(a") = A"+^At o / + Ate" = 0, 

where it is used that = A"+^, fa = I and h'{a"') = d". It 

follows that A'"+^ = -d" for < n < iV - 1. □ 

5. Numerical Results 

We here present some numerical results for the Symplectic Euler scheme 
for a finite difference discretization of (jl.4p . (jl.Sp . The numerics is performed 
in this setting, partly because it is slightly simpler than using finite elements, 
partly because a finite difference discretization is used in [15]. The system 
we will consider is therefore 

^ ^ At (5Z)2r,"+i - 6-\'^+^V'{C)) , (5.1) 

where is a finite difference approximation to y?-, is defined in ()3.22p 
and and rj" correspond to the nodal values of and A", respectively. 
The approximate value used together with this scheme is 

n=N 

KAx\ + At Ax I li/2, (5.2) 

n=l 

where || • ||2 denotes the ordinary Euclidean vector norm. As noted in [15] 
there are several local minima to (15. 2p . corresponding to different "strate- 
gies" to overcome the potential barrier V. The switching between the two 
stable points proceeds by "nucleation" , which involves a large control a, fol- 
lowed by propagation of domain walls. In Figure [5T] the transition is shown 
for the cases propagation of one and two domain walls. The A variable, 
which equals the negative control, is shown in Figure 15.21 for the case of 
propagation of two walls. 

Apart from the Symplectic Forward Euler method previously mentioned, 
the Symplectic Backward Euler method can also be used. This method is 
given by 

^n+i ^^n^ At{5D'^C^^ - 5-^V'{C^^) - r/"), 
= ry"+i + At{5D'^rf - 5-\'^V' {C^^)), 
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Figure 5.1. Snapshots of transitions between the two stable 
configurations where is shown at times 0, 0.2, 0.4, 0.6, 0.8 
and 1 (= T). To the right propagation of one wall and to 
the left propagation of two walls. In these examples K = 10^ 
and 5 = 0.06 was used. 




Figure 5.2. The dual variable A for the case of two propa- 
gating walls corresponding to the left part of Figure 15. 1[ 
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Figure 5.3. Convergence of the optimal values ()5.2p and 
(j5.3|) for the case of equal spacing in space and time, i.e. 
Ax = At. The left figure shows the values obtained by the 
Forward Euler method and the right shows the values of the 
Backward Euler method. 



The approximate value for the Symplectic Backward Euler method is given 
by (see Chapter 4.4 in [20]) 

n=N-l 

KAxWC"" -rWl + ^tAx ^ (5.3) 

ra=0 

An advantage with the Backward Euler method is that it enables using a 
small Ax even when At is not small. This feature is however not as profound 
for the present case of control of a parabolic equation as for the uncontrolled 
case, as the control compensates for the instability, which makes it possible 
to use smaller Ax. Another good thing about the Backward Euler method 
is that it seems to underestimate the optimal value while it seems to be over- 
estimated by the Forward Euler method. Figure 15.31 shows the dependence 
on Ax = At of the values (j5.2p and (j5.3p . By extrapolating these fairly 
straight curves to Ax = At = an approximate value of the optimal control 
problem is obtained. The extrapolated value from the Forward Euler curve 
is 8.517, and the approximate value from the Backward Euler curve is 8.526. 

We now indicate the dependence of the spatial discretization error on Ax. 
This is done by changing the spatial discretization Ax while keeping the time 
discretization At constant. We let the value obtained for the smallest spatial 
discretization Ax be the reference value which takes the role of an "exact" 
solution. A convergence plot can be found in Figure 15.41 The slope of the 
upper part of this curve corresponds to a convergence rate of approximately 

(Ax)2-37. 

For the time discretization error we want to show that it is less than a 
linear function of At with a constant which does not depend on Ax. Time 
discretization convergence is therefore considered for two spatial discretiza- 
tions, one having Ax = 1/30 and the other Ax = 1/100. Since the Forward 
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Figure 5.4. Convergence of the optimal value (y-axis) with 
respect to Ax (x-axis). The case with two propagating do- 
main walls and At = 1/200, 6 = 0.03 and K = 10^. 



and Backward Euler methods in the limit At ^ shall have the same value 
we may extrapolate the values from diagrams similar to the ones in Figure 
15.31 but with the exception that Ax is held fixed. The following values are 
obtained from these extrapolations in the case of two propagating domain 
walls, using 6 = 0.03 and K = 10^: 

• Forward Euler, Ax = 1/30: 8.841 

• Forward Euler, Ax = 1/100: 8.547 

• Backward Euler, Ax = 1/30: 8.849 

• Backward Euler, Ax = 1/100: 8.555 

The mean of the above values for Forward and Backward Euler can be 
taken as an "exact" reference value when convergence is studied. Hence for 
Ax = 1/30 the reference value is taken to be 8.845 and for Ax = 1/100 
it is taken to be 8.551. The two convergence plots can be found in Figure 
15.51 Note that the inclination in the left curve, the values using Ax = 1/30, 
is larger than the inclination in the right curve (Ax = 1/100). This is in 
harmony with Theorem 14.11 since it is allowed that (and good if) we have 
faster convergence for smaller Ax. 

The system (15. ip can be (and has been) solved in two steps. The first 
step gives a starting position for the second step, and may be performed 
on a coarse grid, i.e. using large Ax and At. The method is to choose an 
initial guess ^0 (a vector containing all time steps) and with it compute the 
dual, r]o, using (|5.ip . This computed 770 is used in (15. ip to compute £,upd, 
an updated ^. Using a damping v, a new state ^1 = uS^q + (1 — i^)^upd is 
computed which is used to obtain the dual r/i, which in turn is used to 
compute a new S,upd, and so on. When the difference ^upd — Cn is sufficiently 
small the iterations are terminated, and a starting point (^, rj) is obtained 
for the second step. 
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Figure 5.5. Time discretization convergence for two differ- 
ent spatial discretizations, Ax = 1/30 (left), and Ax = 1/100 
(right). The case with two propagating domain walls and 
6 = 0.03 and K = 10^. 



Step two consists of Newton iterations of (|5.ip . Since the sparse Jacobian 
can be computed explicitly this second step converges at a quadratic rate, 
making it computationally cheap to reach an accurate solution. In the ex- 
amples presented in this chapter the Newton iterations continued until the 
difference between two consecutive ^:s and ry:s was less than 10~^^ in each 
space-time component. After convergence has been reached for some dis- 
cretization, a space-time interpolation of ^ and rj can be used as a starting 
position for a Newton iteration on a new grid. It is also possible to grad- 
ually change the parameters 5 and K in the Newton iterations in order to 
be able to treat a favorite case. When the starting point is sufficiently good 
the Newton method terminates after 5-7 iteration steps, making it fast. As 
comparison, when in [15] a limited memory BFGS method is used, about 
550 iterations is needed to decrease the L2-norm of the objective gradient 
to 10^^*^, even when a clever approximation of the initial Hessian was used. 
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7. Appendix 

In order to show existence and uniqueness of solutions to ()1.4p we intro- 
duce the notion of weak solutions (see [16]). We will let (•,•) denote the 
pairing between H"^ and H^. 

Definition 7.1. We say a function 



^ G L\0,T;H^{0,1)), with ipt G ^^(0, T; ^-^(0, 1)), 
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is a weak solution of (fi.^[ j with ipo G L^(0, 1) provided 
{(pt,v) + (5(v9a;, Uj,) = (-(5" + a, 

/or each v £ Hq(0, 1) and a.e. time t^ <t <T, and 

Weak solutions are in fact more regular than is required in the defini- 
tion when the initial state € //g(0, 1), which is used when proving the 
following theorem. 

Theorem 7.2. There exists a unique weak solution ip to \l.J^ in 
C([to, r]; //q) when (^o S i?g(0, 1). This solution satisfies 

\Wx{t)\\\2 < ||(V0)x|li2+— ||v?0|li4-(J '^\Wo\\\2+5 ^||a|li2(o,T;L2) + — ' 

(7.1) 



for all t G [to,T]. 

Proof. We start by proving existence and uniqueness of solutions to the equa- 
tion (II. 4p when the potential V is used; see figure 
and (^(to) = ^0 and define (p by 

Pt = 5(pxx - 5~^V' {p) + a, ip{to) = LfQ. 

The solution then satisfies (f G L°°{to, T; Hq), so we can define a map 

A:L°^{to,T;H^)^L^ito,T;H^) 

which is single valued (see [16]). It is now shown that ^ is a contraction 
on L°°{tQ,T; Hq) if T is small enough. Let (p = A{ip) and V = ^(V")- 
Subtracting the equations for (p and tp gives 

(p - i:)t = d{p - - 5~\v'{p) - v'm, 

which entails 

ll'^ - V'llL-(to,T;Hl) < K\\V'{<p) - t^'('0)||L2(to,T;L2), 

where the constant K decreases when T decreases (see [16]). The right hand 
side in the previous inequality may be estimated as 

\\y'{^) -y'{^)\\L^{to,T;L-^) < \\y"\\L°-\W - i^\\L^(tQ,T;L^) 
< \\y"\\L'^\\'f - i^\\L2{to,T;H},) < \\y"\\L--VT - toWp - 1p\\Loo^to,T;H^)^ 

SO that A is a contraction when T is small enough. By splitting the interval 
[to, 7"] into smaller subintervals and using the contraction property on each 
such interval we obtain the existence and uniqueness of solutions to (jl.4p 
when the potential V is used. There exists a continuous representative of 
solutions to (|1.4p in the equivalence class in L°°{to,T] H^) (see [16] again) 
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which we call 99. Since the solution lives in one space dimension it is con- 
tinuous as a function of both space and time. So for each M > \ \(po\\ there 
is a time T* such that ||<^(*)llc(o,i) < ^ fo^' t < T*. Thus, in a cer- 
tain time interval the solution (p is only affected by the unchanged potential 
V (it never touches the level where V changes into V). Consider a time 
in this interval, and take the inner product with ipt in (jl.4p to get (using 

II m2 S d . . ^ ^1! ii4 ^ ^1! ii2 ^"^1! ii2 '^1! ii2 



The llv'tllia terms are dropped and the resulting inequality is integrated 
from to to T*: 

sWMTnWh + ^-^MT*)\\l - 6-'MT*)\\l, 

< ^\\i'Po)x\\l2 + —11^0^4 - S ^\\(po\\l2+ \Ml2dt. 



to 



It is now used that 

^-^MTlWh - S-'MTlWh = I' (^^^^ - ^{x,T*)')dx > -\ 
so that the previous inequality implies 

5||</'x(T*)||i. <5||(<po)x|li2 + ^||v^o|li4-ri||v^o|li2+^^ \\a\\l,dt + ]^. 

(7.2) 

By Sobolev's inequality we thereby obtain a bound on the continuous func- 
tion (p(T*) in the supremum norm. Consequently, for all controls a € 
L'^{to,T; L'^) it is possible to choose the border point s in Figure [L2] be- 
tween V and V large enough so that the solution ip is affected only by the 
unchanged potential V. Note also that it is possible to choose T* = T in 
(j7.2p . Such a solution is a weak solution to (jl.4p with the original potential 
V. It is unique in C{tQ,T; Hq), for non-uniqueness would otherwise also 
hold for some modified potential V. The error bound (17. ip follows from 

ra . □ 
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